3-base periodicity in coding DNA is affected by intercodon dinucleotides
نویسنده
چکیده
All coding DNAs exhibit 3-base periodicity (TBP), which may be defined as the tendency of nucleotides and higher order n-tuples, e.g. trinucleotides (triplets), to be preferentially spaced by 3, 6, 9 etc, bases, and we have proposed an association between TBP and clustering of same-phase triplets. We here investigated if TBP was affected by intercodon dinucleotide tendencies and whether clustering of same-phase triplets was involved. Under constant protein sequence intercodon dinucleotide frequencies depend on the distribution of synonymous codons. So, possible effects were revealed by randomly exchanging synonymous codons without altering protein sequences to subsequently document changes in TBP via frequency distribution of distances (FDD) of DNA triplets. A tripartite positive correlation was found between intercodon dinucleotide frequencies, clustering of same-phase triplets and TBP. So, intercodon C|A (where "|" indicates the boundary between codons) was more frequent in native human DNA than in the codon-shuffled sequences; higher C|A frequency occurred along with more frequent clustering of C|AN triplets (where N jointly represents A, C, G and T) and with intense CAN TBP. The opposite was found for C|G, which was less frequent in native than in shuffled sequences; lower C|G frequency occurred together with reduced clustering of C|GN triplets and with less intense CGN TBP. We hence propose that intercodon dinucleotides affect TBP via same-phase triplet clustering. A possible biological relevance of our findings is briefly discussed.
منابع مشابه
تخمین مکان نواحی کدکننده پروتئین در توالی عددی DNA با استفاده پنجره با طول متغیر بر مبنای منحنی سه بعدی Z
In recent years, estimation of protein-coding regions in numerical deoxyribonucleic acid (DNA) sequences using signal processing tools has been a challenging issue in bioinformatics, owing to their 3-base periodicity. Several digital signal processing (DSP) tools have been applied in order to Identify the task and concentrated on assigning numerical values to the symbolic DNA sequence, then app...
متن کامل3 - , 10 . 5 - , 200 - and 400 - base periodicities in genome sequences
The above periodicities are the main hidden oscillating patterns detected so far in the genomic sequences. The 3-base periodicity is characteristic for the protein-coding sequences only. The source of the approximately 10.5-base sequence period is twofold. On the one hand, the sequences coding for alpha-helical coiled-coil regions in proteins have the hidden 3.5 aminoacid repeat which appears a...
متن کاملHeterogeneous periodicity of drosophila mtDNA: new refutations of neutral and nearly neutral evolution.
We found a consistent 3-site periodicity of the X²9 values for the heterogeneity of the distribution of the second base in relation to the first base of dinucleotides separated by 0 (contiguous), 1, 2, 3 ... 17 (K) nucleotide sites in Drosophila mtDNA. Triplets of X²9 values were found where the first was over 300 and the second and third ranged between 37 and 114 (previous studies). In this st...
متن کاملInternucleotide correlations and nucleotide periodicity in Drosophila mtDNA: new evidence for panselective evolution.
Analysis for the homogeneity of the distribution of the second base of dinucleotides in relation to the first, whose bases are separated by 0, 1, 2,... 21 nucleotide sites, was performed with the VIH-1 genome (cDNA), the Drosophila mtDNA, the Drosophila Torso gene and the human p-globin gene. These four DNA segments showed highly significant heterogeneities of base distributions that cannot be ...
متن کاملPrediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence.
With the exponential growth of genomic sequences, there is an increasing demand to accurately identify protein coding regions (exons) from genomic sequences. Despite many progresses being made in the identification of protein coding regions by computational methods during the last two decades, the performances and efficiencies of the prediction methods still need to be improved. In addition, it...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 6 شماره
صفحات -
تاریخ انتشار 2011